Structured Literature Image Finder: Extracting Information from Text and Images in Biomedical Literature
Identifieur interne : 000616 ( Main/Exploration ); précédent : 000615; suivant : 000617Structured Literature Image Finder: Extracting Information from Text and Images in Biomedical Literature
Auteurs : Pedro Coelho [États-Unis] ; Amr Ahmed [États-Unis] ; Andrew Arnold [États-Unis] ; Joshua Kangas [États-Unis] ; Abdul-Saboor Sheikh [États-Unis] ; P. Xing [États-Unis] ; W. Cohen [États-Unis] ; F. Murphy [États-Unis]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2010.
Abstract
Abstract: Slif uses a combination of text-mining and image processing to extract information from figures in the biomedical literature. It also uses innovative extensions to traditional latent topic modeling to provide new ways to traverse the literature. Slif provides a publicly available searchable database (http://slif.cbi.cmu.edu). Slif originally focused on fluorescence microscopy images. We have now extended it to classify panels into more image types. We also improved the classification into subcellular classes by building a more representative training set. To get the most out of the human labeling effort, we used active learning to select images to label. We developed models that take into account the structure of the document (with panels inside figures inside papers) and the multi-modality of the information (free and annotated text, images, information from external databases). This has allowed us to provide new ways to navigate a large collection of documents.
Url:
DOI: 10.1007/978-3-642-13131-8_4
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000684
- to stream Istex, to step Curation: 000676
- to stream Istex, to step Checkpoint: 000196
- to stream Main, to step Merge: 000621
- to stream Main, to step Curation: 000616
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct:series"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Structured Literature Image Finder: Extracting Information from Text and Images in Biomedical Literature</title>
<author><name sortKey="Coelho, Pedro" sort="Coelho, Pedro" uniqKey="Coelho P" first="Pedro" last="Coelho">Pedro Coelho</name>
</author>
<author><name sortKey="Ahmed, Amr" sort="Ahmed, Amr" uniqKey="Ahmed A" first="Amr" last="Ahmed">Amr Ahmed</name>
</author>
<author><name sortKey="Arnold, Andrew" sort="Arnold, Andrew" uniqKey="Arnold A" first="Andrew" last="Arnold">Andrew Arnold</name>
</author>
<author><name sortKey="Kangas, Joshua" sort="Kangas, Joshua" uniqKey="Kangas J" first="Joshua" last="Kangas">Joshua Kangas</name>
</author>
<author><name sortKey="Sheikh, Abdul Saboor" sort="Sheikh, Abdul Saboor" uniqKey="Sheikh A" first="Abdul-Saboor" last="Sheikh">Abdul-Saboor Sheikh</name>
</author>
<author><name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
</author>
<author><name sortKey="Cohen, W" sort="Cohen, W" uniqKey="Cohen W" first="W." last="Cohen">W. Cohen</name>
</author>
<author><name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:AE5DBE1CCB5FEE2907CEDC3DD02F2B7AFBA41CA4</idno>
<date when="2010" year="2010">2010</date>
<idno type="doi">10.1007/978-3-642-13131-8_4</idno>
<idno type="url">https://api.istex.fr/document/AE5DBE1CCB5FEE2907CEDC3DD02F2B7AFBA41CA4/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000684</idno>
<idno type="wicri:Area/Istex/Curation">000676</idno>
<idno type="wicri:Area/Istex/Checkpoint">000196</idno>
<idno type="wicri:doubleKey">0302-9743:2010:Coelho P:structured:literature:image</idno>
<idno type="wicri:Area/Main/Merge">000621</idno>
<idno type="wicri:Area/Main/Curation">000616</idno>
<idno type="wicri:Area/Main/Exploration">000616</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Structured Literature Image Finder: Extracting Information from Text and Images in Biomedical Literature</title>
<author><name sortKey="Coelho, Pedro" sort="Coelho, Pedro" uniqKey="Coelho P" first="Pedro" last="Coelho">Pedro Coelho</name>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation><wicri:noCountry code="no comma">Joint Carnegie Mellon University-University of Pittsburgh Ph.D. Program in Computational Biology</wicri:noCountry>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author><name sortKey="Ahmed, Amr" sort="Ahmed, Amr" uniqKey="Ahmed A" first="Amr" last="Ahmed">Amr Ahmed</name>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author><name sortKey="Arnold, Andrew" sort="Arnold, Andrew" uniqKey="Arnold A" first="Andrew" last="Arnold">Andrew Arnold</name>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author><name sortKey="Kangas, Joshua" sort="Kangas, Joshua" uniqKey="Kangas J" first="Joshua" last="Kangas">Joshua Kangas</name>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation><wicri:noCountry code="no comma">Joint Carnegie Mellon University-University of Pittsburgh Ph.D. Program in Computational Biology</wicri:noCountry>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author><name sortKey="Sheikh, Abdul Saboor" sort="Sheikh, Abdul Saboor" uniqKey="Sheikh A" first="Abdul-Saboor" last="Sheikh">Abdul-Saboor Sheikh</name>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author><name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation><wicri:noCountry code="no comma">Joint Carnegie Mellon University-University of Pittsburgh Ph.D. Program in Computational Biology</wicri:noCountry>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author><name sortKey="Cohen, W" sort="Cohen, W" uniqKey="Cohen W" first="W." last="Cohen">W. Cohen</name>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation><wicri:noCountry code="no comma">Joint Carnegie Mellon University-University of Pittsburgh Ph.D. Program in Computational Biology</wicri:noCountry>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author><name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation><wicri:noCountry code="no comma">Joint Carnegie Mellon University-University of Pittsburgh Ph.D. Program in Computational Biology</wicri:noCountry>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4"><country>États-Unis</country>
<placeName><settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2010</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">AE5DBE1CCB5FEE2907CEDC3DD02F2B7AFBA41CA4</idno>
<idno type="DOI">10.1007/978-3-642-13131-8_4</idno>
<idno type="ChapterID">4</idno>
<idno type="ChapterID">Chap4</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: Slif uses a combination of text-mining and image processing to extract information from figures in the biomedical literature. It also uses innovative extensions to traditional latent topic modeling to provide new ways to traverse the literature. Slif provides a publicly available searchable database (http://slif.cbi.cmu.edu). Slif originally focused on fluorescence microscopy images. We have now extended it to classify panels into more image types. We also improved the classification into subcellular classes by building a more representative training set. To get the most out of the human labeling effort, we used active learning to select images to label. We developed models that take into account the structure of the document (with panels inside figures inside papers) and the multi-modality of the information (free and annotated text, images, information from external databases). This has allowed us to provide new ways to navigate a large collection of documents.</div>
</front>
</TEI>
<affiliations><list><country><li>États-Unis</li>
</country>
<region><li>Pennsylvanie</li>
</region>
<settlement><li>Pittsburgh</li>
</settlement>
<orgName><li>Université Carnegie-Mellon</li>
</orgName>
</list>
<tree><country name="États-Unis"><region name="Pennsylvanie"><name sortKey="Coelho, Pedro" sort="Coelho, Pedro" uniqKey="Coelho P" first="Pedro" last="Coelho">Pedro Coelho</name>
</region>
<name sortKey="Ahmed, Amr" sort="Ahmed, Amr" uniqKey="Ahmed A" first="Amr" last="Ahmed">Amr Ahmed</name>
<name sortKey="Ahmed, Amr" sort="Ahmed, Amr" uniqKey="Ahmed A" first="Amr" last="Ahmed">Amr Ahmed</name>
<name sortKey="Arnold, Andrew" sort="Arnold, Andrew" uniqKey="Arnold A" first="Andrew" last="Arnold">Andrew Arnold</name>
<name sortKey="Coelho, Pedro" sort="Coelho, Pedro" uniqKey="Coelho P" first="Pedro" last="Coelho">Pedro Coelho</name>
<name sortKey="Cohen, W" sort="Cohen, W" uniqKey="Cohen W" first="W." last="Cohen">W. Cohen</name>
<name sortKey="Cohen, W" sort="Cohen, W" uniqKey="Cohen W" first="W." last="Cohen">W. Cohen</name>
<name sortKey="Cohen, W" sort="Cohen, W" uniqKey="Cohen W" first="W." last="Cohen">W. Cohen</name>
<name sortKey="Kangas, Joshua" sort="Kangas, Joshua" uniqKey="Kangas J" first="Joshua" last="Kangas">Joshua Kangas</name>
<name sortKey="Kangas, Joshua" sort="Kangas, Joshua" uniqKey="Kangas J" first="Joshua" last="Kangas">Joshua Kangas</name>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<name sortKey="Sheikh, Abdul Saboor" sort="Sheikh, Abdul Saboor" uniqKey="Sheikh A" first="Abdul-Saboor" last="Sheikh">Abdul-Saboor Sheikh</name>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000616 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000616 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:AE5DBE1CCB5FEE2907CEDC3DD02F2B7AFBA41CA4 |texte= Structured Literature Image Finder: Extracting Information from Text and Images in Biomedical Literature }}
This area was generated with Dilib version V0.6.32. |